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ABSTRACT 

Recent genomic analyses on the cellular metabolic network show that reaction flux 
across enzymes are diverse and exhibit power-law behavior in its distribution. While 
one may guess that the reactions with larger fluxes are more likely to be lethal under 
the blockade of its catalyzing gene products or gene knockouts, we find, by in silico 
flux analysis, that the lethality rarely has correlations with the flux level owing to the 
widespread backup pathways innate in the genome-wide metabolism of Escherichia 
coli. Lethal reactions, of which the deletion generates cascading failure of following 
reactions up to the biomass reaction, are identified in terms of the Boolean network 
scheme as well as the flux balance analysis. The avalanche size of a reaction, defined as 
the number of subsequently blocked reactions after its removal, turns out to be a useful 
measure of lethality. As a means to elucidate phenotypic robustness to a single dele- 
tion, we investigate synthetic lethality in reaction level, where simultaneous deletion of 
a pair of nonlethal reactions leads to the failure of the biomass reaction. Synthetic lethals 
identified via flux balance and Boolean scheme are consistently shown to act in parallel 
pathways, working in such a way that the backup machinery is compromised. 
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1 Introduction 



Complex machinery of cellular metabolism occurring in a living organism makes up a 
part of autocatalytic network of biochemical reaction pathways. The reaction network 
in itself constitutes an intricate web in such a way as sharing intermediates. Yet an- 
other dimension of complexity comes from the tight control of reactions by functional 
proteins which are again under transcriptional, translational control as well as degrada- 
tive and other inductive regulations, making even a single pathway analysis formidable 
task. Only recently, advances in high-throughput experiments and the computing power 
incorporating diverse data sets collected in genomic research make it possible to con- 
struct cellular networks of metabolism in genome- wide perspectives. At the same time, 
many quantitative theoretical methods including graph theories and other mathematical 
tools developed from diverse disciplines attract much attention to tackle the large-scale 
networks (Xia et al., 2004; Barabasi & Oltvai, 2004). 

In the early graph-theoretic approaches to the metabolic network, attention has been 
paid to the so-called scale-free feature of topological structure (Jeong et al., 2000), 
small-world-ness (Wagner and Fell, 2001), modularity (Girvan and Newmann, 2002) 
and hierarchical organization (Ravasz et al., 2002). Despite the immanent specificity 
in cellular functions of various organisms, the connectivity, or number of connections 
each node (metabolite or associated reactions) has, is generally far from homogeneous. 
In particular, this connectivity distribution of the metabolic network, as shared by many 
naturally occurring complex networks, follows a power law, meaning large deviations 
in spite of well defined average value. It is this context that borrows the term scale- 
free network, where hubs, nodes with large number of connections, play essential roles. 
When such hubs are removed or turned off, the whole system becomes vulnerable. In- 
deed, it was found (Jeong et al., 2001) that, for the yeast protein interaction network, 
hub proteins are more likely to be lethal than the others. 

In the framework of networks, metabolic reactions and participating metabolites 
can be mapped into alternating nodes, where the outward(inward) connections from 
a reaction node indicate that those metabolites are produced(consumed) as a result of 
the reaction. Once constructing a directed bipartite graph in this way, we calculate 
graph-theoretic quantities that characterize the global topology and give a clue to as- 
sessing lethality of metabolic reactions. Then, we examine the correlations between the 
metabolic flux level and the lethality of each metabolic reaction using the flux balance 
analysis[FBA] (Edwards and Palsson, 2000). Here, by lethal, we mean the organism 
could rarely synthesize the indispensable biomass, or the flux of the biomass reaction 
is significantly reduced when that reaction is blocked or removed from the network. 
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mimicking gene knockout experiments*. One seemingly counterintuitive result is the 
absence of correlation between flux level and lethality (Fig. 2), which is related with the 
fact that the high-flux reactions have abundant bypasses or backup pathways. 

We also invoke the Boolean network scheme, an idealization of the metabolic net- 
work as a wiring of binary logic gates to elucidate the pathway structure of the network 
on the logical basis. Considering the knockout and consequent cascading failure in the 
metabolic reaction network as an avalanche, we investigate the distribution of avalanche 
size defined as the number of reactions subsequently turned off on account of the re- 
moval of a target reaction to find it a good measure of lethality. The distribution follows 
a power law with the characteristic exponent around 2.5, pervasive throughout disparate 
model systems having self-organized criticality (Bak, 1996). 

In the latter part, we investigate the effects of simultaneous deletion of multiple re- 
actions with a view to elucidating the interaction between them. Though only a small 
portion of reactions lead to distinctive phenotype under their single deletion, it is non- 
trivial to make a prediction on how destructive a dual deletion of nonlethal reactions. 
Actually, the effect can be aggravating or alleviating as well as simple sum of each, de- 
pending on their role played in the otherwise intact metabolic network. In particular, 
when a pair of nonlethal reactions are deleted to make no growth of cell, we call it syn- 
thetic lethal, metabolic homologue of the same term in genetics. Synthetic lethality in 
metabolic network is a manifestation of their complementary nature responsible for the 
buffering between alternative parallel pathways. We show the synthetic lethal pairs are 
distributed over distinct the avalanche size of a pair of reactions is strongly correlated 
with its synthetic lethality. 



2 Materials and Methods 

We use, with minor curation, the recent revision of in silico model E. coli (Reed et al., 

2003), which was obtained by searching databases, such as LIGAND (http://www.genome.jp/kegg/ligand.html I, 
EcoCyc (http://www.ecocyc.org), TC-DB (http://tcdb.ucsd.edu/), and referring to up- 
dated literatures on sequence annotation (Serres et al., 2001). To mimic random or 
targeted mutation strains, a specific reaction is removed from the network and the resul- 

*More recently, Segre et al. proposed alternative scheme phrased as "minimization of metabolic ad- 
justment[MOMA]" for the phenotypic prediction of deletion mutants. Instead of assuming optimality 
in growth yield of deletion mutants, MOMA approximates metabolic phenotype by performing distance 
minimization in flux space, whereby the correlation with experimental results are improved. See Segre 
2002. 
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tant metabolic capabilities are to be assessed. For this purpose, we introduce a single 
pivotal reaction, the biomass reaction, originally formulated as a linear combination of 
essential metabolic reactions giving rise to the growth of the organism (Neidhardt and 
Umbarger, 1996). Throughout the study, lethality of a certain reaction is determined by 
the flux of this biomass production, which is contingent to the ansatz of optimality that 
the selection pressure has imposed in the long history of evolution. 



2.1 Metabolic Network as a Graph 

The overall map of metabolic reactions we study is a bipartite graph, composed of two 
different types of nodes, 627 metabolites and their participating 1074 metabolic reac- 
tions including transport and exchange events. One type of nodes connect only to the 
other type of nodes in the networks^ Each link between a pair of a metabolite and 
a reaction is directed, reflecting the metabolite is either consumed (substrate) or pro- 
duced (product) or both in reactions. Of the 1074 reactions, 254 reversible reactions 
are decomposed into two separate reactions catalyzed by the same enzyme, and possible 
multiple connections between metabolites via isozymes are taken as a single directed 
edge in the graph representation. 627 distinct metabolites have either their intracellular 
or extracellular version or both, which sum into 761 distinct nodes of metabolites. Once 
the metabolic network is reconstructed as a graph, we quantify, by various numerical 
measures, the lethality of each node in the wild-type strain and compare them with those 
of knockout mutants. 



For each metabolite in the metabolic reaction network, dynamic flux balance condition 
on the concentration Xi can be written as 



where <pj is the corresponding reaction rate (outward flux), and Sij are stoichiometric 
coefficients involved in the metabolite i E {1, 2, ■ ■ ■ , m} participating in the reaction 
j E {1,2, - ■ ■ ,n} including transport and exchange reactions also. As an alternative 

^Depending upon the objectives, it can be projected to recover either of the single-mode metabohte 
network or reaction (enzyme) network. 



2.2 Flux Balance Analysis (FBA) 




(1) 
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to yet intractable kinetic models in genome-wide perspectives, mass conservation can 
be applied in the balanced state to give the stationary condition J2j = for 

all i. Stoichiometric matrix encodes the topology of the metabolic network and plays 
the role of weighted adjacency matrix in the corresponding graph. Once the stoichio- 
metric coefficients are given for all reactions, we have, in general, under-determined 
situation(m < n) where huge degeneracy in the null space of Eq. dU) is unavoidable. 
Since, in reality, enzymatic capacity cannot be arbitrarily large, flux values are also con- 
strained to vary in limited range, which can be estimated via, say, the Michaelis-Menten 
type kinetics. Actually, this multiplicity in the feasible metabolic state can be consid- 
ered a manifestation of the capability of a metabolic genotype. It makes physiological 
sense in that cells are expected to adapt themselves responding to different stresses and 
growth conditions. Among those feasible metabolic phenotypes, FBA assumes the ex- 
istence of optimized point(s) of a certain objective function, called biomass reaction or 
growth flux, by utilizing further constraints based on the thermodynamic irreversibility 
of reactions. 



2.3 The Boolean Network 

Metabolic reactions or genetic switches are seldom turned on or off. Instead, they can 
change by some fold, either up- or down-regulated, to make a physiological payoff. As a 
first approximation. Boolean idealization has long been introduced to reconstruct the ge- 
netic regulatory network as a directed graph (Kauffman, 1969) and was recently adopted 
to model the metabolic network (Lemke et al., 2004). In Boolean reconstruction of the 
metabolic network, each node in the graph is replaced with binary logic gate AND or 
OR. A metabolite would not cease to be existent in the network until all the reactions 
having that metabolite as a product are blocked, while a reaction does not take place any 
more with only a single absence out of substrate metabolites, which renders the metabo- 
lites Boolean disjunction, OR, and reactions conjunction, AND as exemplified in Fig. 1. 
With all nodes given an initial condition, we iteratively relax the network till the network 
is settled in a fixed point. 



2.4 Cumulative Lethality Score 

As a way to reveal the correlation between avalanche size and lethality, we use the in- 
dex, cumulative lethality score (CLS), the original version of which was introduced in 
the context of protein essentiality prediction (Jeong et al, 2003). Once a measure of 
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lethality, say the flux level of each reaction, is proposed, we can make a serial list of re- 
actions assorted in descending order of the proposed lethality measure. With the lethality 
criteria determined through the viability under the single deletion of each reaction, we 
can assign a binary lethality score, one or zero, into each reaction depending on whether 
it is lethal or not. Summing up the binary scores from the first rank to the jth rank in the 
value of the proposed quantity, we get the CLS, say, L(j). If the proposed quantity is 
positively(negatively) correlated with the lethality, i.e. reactions holding high ranks are 
dominated by the binary lethality score one(zero), the normalized CLS manifests itself 
as a concave(convex) curve. Otherwise, if it is a random sequence of zero's and one's, 
L{j) is given by a straight line. That is, the more correlated is the measure with the 
lethality, the higher curvature L{j) develops. 



3 Results and Discussion 



3.1 Flux Level versus Lethality 

Under aerobic condition with nine distinct carbon sources, we identify 208 (19.6%) 
common lethal reactions, the single blockade of which suffocates the biomass produc- 
tion^. It is similar in fraction to the essential fraction of S. cerevisiae genome, 18.7% 
(Tong et al., 2001; Winzeler et al., 1999; Giaever et al., 2002). 

The flux distribution in the wild-type metabolic network follows a power-law in 
Pareto's form, F((/>) ~ (0 + f/'o) "- Metabolic traffic is concentrated along a few 'super- 
highway' reactions, while the vast majority of reactions are in charge of only a small flux 
(Almaas et al., 2004). In the meantime, inspired by the roles the main arteries, principal 
roads, or backbones play in blood circulation, transportation, or data communications, 
we examine the possibility that the flux level should reflect the lethality of each reaction. 
Fig. 2 shows that the plausible correspondence between flux level and lethality is a mere 
conjecture to prove not true. In other words, there is no correlation at all between those 
quantities, and high flux itself has nothing to do with the lethality of a reaction, which 
obviously contradicts our intuition. We also investigate the flux redistribution profile 
upon deletion of a high-flux reaction. Reaction fluxes are redistributed either locally or 

^Throughout the analysis, acetate, alpha-ketoglutarate, glucose, glycerol, (L-,D-)lactate, L-malate, 
pyruvate, and succinate are used as the carbon sources. Though the lethality of a reaction does depend on 
which are used as carbon sources, 208 lethal reactions are shared by every sources and only 22 reactions 
show nutrient-dependent lethality. 
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globally. Here, by local, we mean the case that very few reactions, having almost zero 
flux in the wild type, fully take over the flux of the reaction deleted. However, in the 
global redistribution, the flux of the removed reaction is shared over a large number of 
reactions to keep optimal biomass production. 



3.2 Avalanche Size versus Lethality 

In the Boolean reconstruction of the E. coli metabolic network, 41 lethal reactions are 
identified, which are all lethal in FBA also. It is no wonder, if we consider that lethality 
in binary scheme is minimal and more stringent than in the weighted version of FBA. 
To quantify the effect of a single-node deletion, we define the avalanche size of each 
reaction, in the Boolean scheme, as the number of reactions subsequently turned off on 
account of the removal of a target reaction. 

As shown in the inset of Fig. 3, the avalanche size distribution for E. coli metabolic 
network also displays a power law behavior, implying there exist a few reactions whose 
deletion triggers a large destructive avalanche cascade in metabolic reaction. In math- 
ematical aspects, at criticality, there exists an infinite spanning cluster enough to reach 
the ultimate destination, biomass. Thus, avalanche size could be useful to find lethal 
reaction. Indeed, the reaction which generates a bigger avalanche size is more likely to 
be lethal as seen in the CLS plot (main frame of Fig. 3). 

One may want to check the possibility that nearer nodes to the biomass reaction 
should be liable to be lethal. However, of the 374 distinct reactions that produce the 
substrates of biomass reaction, only 13 of them are lethal in the Boolean scheme, making 
the identification of lethal reactions nontrivial — proximity to the biomass reaction has 
nearly nothing to do with the lethality of a reaction. 

The notion of the avalanche in the network can be extended to the FBA scheme, 
where the avalanche size of a reaction is defined as the number of reactions whose flux 
levels under its knockout differ from those in the wild type. Actually, we used three 
different criteria of the avalanche: (i) reactions are newly turned on or off, (ii) flux level 
change exceeds an arbitrary cutoff value, and (iii) fractional changes in flux exceeds the 
cutoff value. There is, however, little difference among the different counting schemes. 
Fig. 4, drawn by using the criterion (i) above, reveals the absence of correlation between 
the flux level and the avalanche size both in the Boolean scheme and in FBA, which is 
consistent with the fact that flux level is irrelevant to assessing lethality. (Fig. 2) 

Due to the small-world-ness of the complex network, local perturbations are liable 
to propagate to the whole network leading to the sharing of load, which underlies the 
system-wide high flexibility. At the same time, because of the scale-free-ness, avalanche 
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cascade can either be long-ranged by making a large number of nodes bear parts of the 
'expenses' or be absorbed at a short distance from the source of perturbation, depending 
on the detailed functional characteristics of the reaction. In other words, the response to 
single deletion perturbation of a single reaction are too diverse to definitely predict how 
they would be, and so they can be predicted in a probabilistic way (Kim et al., 2003). 
Identification of lethal reactions in metabolic network can be viewed in the same footing. 
The effects of a node removal or a gene deletion are largely negligible as a whole, which 
is manifested by the dominance of nonlethal reactions. However, even if a reaction is 
not lethal, its potential damage to the network varies. It is this insufficiency in lethality 
assessing that raises the need for the knowledge of synthetic lethals in the next section. 



3.3 Synthetic Lethality in Metabolic Network 

The close genetic relationships between genes which underlie the functional buffering 
has been associated with the notion of synthetic lethality. It has been assessed in a high- 
throughput manner by the synthetic genetic array(SGA) analysis (Tong et al., 2001). 
Likewise, analysis of multiple-deletion mutants in genome-scale metabolic network may 
shed light on novel topological features of backup pathways leading to the robustness. 
In the restricted level of metabolism, such relationships can be revealed more clearly by 
performing the double reaction knockout experiments, which can be easily performed 
in silico. Furthermore, such synthetic lethal pairs identified allow us to track the backup 
pathways and to visualize the precise genomic origin from which the metabolic pheno- 
typic stability arises. 

When the glucose is used as a carbon source in aerobic condition, 55 synthetic lethal 
pairs are identified out of all possible pairs of reactions(Table I). Relatively small number 
of synthetic lethal pairs suggests high density of backup pathways for a given specific 
condition in the metabolic network and is in accordance with the case of the yeast (Papp 
et al, 2004). Among these, 33 (60%) pairs are involved in the same subdivision of 
the reaction categories. As expected, most of those homofunctional synthetic lethals 
usually work as the 'simple' backup pathway: One of the pair is never used in the 
wild type but it almost completely takes over the flux of the blocked reaction, which 
amounts to 64% of total synthetic lethal pairs, as depicted in Fig. 5(a). We note, however, 
the greater part of such simple takeover case might be attributed to an artifact of the 
optimization scheme. Since there can be multiple solutions yielding the same maximized 
growth rate, and one cannot select a specific solution over the others without relevant 
physiological constraints, such as regulations in transcription level. Nevertheless, the 
effect of degeneracy on the (synthetic) lethality assay is not harmful since the deletion 
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of lethal reaction unambiguously indicates no feasible solutions for nonzero biomass 
production. Interestingly, the homofunctional synthetic lethals of the other type, for 
which both the reactions are operational in the wild type, are mostly involved in the two 
subsystems of the pentose phosphate cycle and threonine-lysine metabolism. Excluding 
these particular cases, 91% of homofunctional synthetic lethals are simple, while, for the 
other 22 heterofunctional synthetic lethals, only 9 (41%) of them are the simple backup 
pathways. In total, 25 pairs were both operational in the wild type and the remaining 30 
are paired in one used and one unused in the wild type. 

We also investigate the synthetic lethality in the Boolean scheme, where 37 pairs 
of synthetic-lethal doublets are identified in addition to 41 lethal singlets. Contrary to 
the lethal singlets which are dominated by cell envelope biosynthesis (78%), synthetic 
lethals are scattered throughout diverse functional categories (Fig. 7). 

Conserved across both the network scheme is the proximity of the two reactions 
constituting a synthetic-lethal pair. Fig. 6 illustrates the distance distribution for the 
synthetic-lethal reaction pairs. As shown in the inset of Fig. 6, 94.5% (97.3%) of 
synthetic-lethal pairs in FBA(Boolean) scheme are apart in three steps and below, which 
stands out sharp contrast to the fraction 42.4% for all pair of metabolites. If we elim- 
inate, to make better sense of biochemical reaction pathways, currency metabolites or 
cofactors from the network, the fraction of synthetic-lethal pairs whose separation is 
three steps and below are 96.2% and 91.1% in FBA and Boolean scheme, respectively 
(main plot). Whether the analytical scheme is Boolean or FBA, long-ranged complicity 
of synthetic lethals through the intermediary of a 'filamentary' single pathway like in 
Fig. 5(b) comprises only a small fraction(<5%) of synthetic-lethal pairs, and is rather 
an exception. 

Another important outcome regarding synthetic lethality is already shown in Fig. 3, 
where we measure the avalanche size of the synthetically lethal doublets and triplets 
in addition to that of the singlets. Synthetic lethal multiplets give rise to even higher 
correlation of their avalanche size with the lethality. In effect, robustness in metabolic 
network stems from redundancy in branched and parallel pathways. Conversely, lack of 
reaction pathways, whether it is due to the innate biochemical nature or to the incom- 
pleteness of pathway database, lead to vulnerability. Hence, the more we know about a 
reaction pathway, the less probable it should contain lethal reactions. In particular, we 
cannot completely rule out the latter possibility: A quarter of the E. coli genome is yet to 
be functionally assigned. Accumulated bias in molecular biology research, if any, might 
be considerable to our result that central pathway across the species, such as the citrate 
cycle or the glycolysis pathways, have very few lethal reactions (Fig. 7). However, at 
least for E. coli, one of the best known bacteria yet studied, there are no good reasons to 
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suspect such a bias. Moreover, our analytical results are compatible with the fact that a 
wide-spread strategy of antimicrobials is acting against cell wall synthesis(fosfomycin, 
cycloserine) or integrity (lysozyme). Rather to be supposed is that the more important a 
reaction is, the better facilitated its backup pathways have been during evolution. 



4 Summary and Outlook 

Systematic deletion study in a genome-wide view of model organisms help reveal the 
organizing principles of the metabolic network and may shed light on how the selection 
has been embodied at the network levels, and especially the recent controversy sur- 
rounding the causes and evolution of the enzyme dispensability (Papp et al., 2004). As 
an index quantifying lethality in the graph-reconstructed metabolic reaction network, we 
propose the avalanche size of each reaction, the number of 'dead' reactions due to the 
knockout of that reaction or its related gene products to show an even more remarkable 
interdependence than various measures yet proposed. By identifying synthetic lethals or 
lethal multiplets in the genome-scale metabolic network under controlled environments, 
we see the emergence of phenotypic stability supported by rich backup pathways, which 
is shared by diverse realization of complex networks with scale-free nature. Studies on 
multiple deletion mutations in metabolomic interaction network can also be applied to 
natural metabolomic variations, reminiscent of single-nucleotide polymorphism, giving 
rise to practical buffering or phenotypic robustness under targeted mutations (Tong et 
al., 2001; Tucker and Fields, 2003; Ooi et al., 2003). Furthermore, if we incorporate 
network analyses on metabolism with the gene-protein-reaction associations by includ- 
ing the other sectors of intra- and inter-cellular networks, it can be used in designing new 
microorganismal strains on the computer truly beyond the reductionist perspectives. 



Note Added at Proof 

After completion of the present study, we learned of the work by Burgard et al. (2004), 
who introduced seemingly parallel concept of flux coupling analysis to our avalanche 
analysis in FBA scheme. Originally, the metabolic flux 0i was defined as (directionally) 
'coupled' to (j)2 when a nonzero flux value 02 implies a nonzero flux value (pi in any 
stationary states of the system. Taking the contraposition, so as to interpret it in the 
context of the lethality, 0i is coupled to 02- Then the zero flux of (pi implies the zero 
flux of 02- While the avalanche size is correlated with the number of fluxes coupled with 
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the biomass reaction, they are not mathematically identical, because the former refers 
only to the optimal flux distribution, whereas the latter covers all possible stationary 
states. 
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Appendices 

Abbreviation lists for metabolites and reactions (enzymes) in Fig. 1, 5 and Table I. 



4r5:iu 


4-{ l-D-Ribitylamino)-5-aminouracil 


h 


H+ 


adp 


ADP 


h2o 


H2O 


akg 


2-Oxoglutarate 


icit 


Isocitrate 


atp 


ATP 


mal-L 


L-malate 


cdp 


CDP 


nadh 


Nicotinamide adenine dinucleotide, reduced 


cit 


Citrate 


nadph 


Nicotinamide adenine dinucleodde phosphate, reduced 


co2 


CO2 


oaa 


Oxaloacetate 


coa 


Coenzyme A 


pi 


Phosphate 


ctp 


CTP 


ppi 


Diphosphate 


dMp 


3,4-dihydroxy-2-butaiione 4-phosphate 


qs 


Ubiquinone- 8 


dcdp 


dCDP 


q8h2 


UbiquinoI-8 


dctp 


dCTP 


libiiv 


Riboflavin 


dmlz 


6,7-Diiirethyl-8-( 1 -D-ribityl)lumazine 


succ 


Succinate 


fad 


FAD 


succoa 


Succinyl-CoA 


fadh2 


FADH2 


trdox 


Oxidized thioredoxin 


fmn 


FMN 


trdrd 


educed thioredoxin 


fum 


FLimarate 






ACONT 


aconitase 


MTHFC 


methenyltetrahydrofolate cyclohydrolase 


ADHEr 


Acetaldehyde dehydrogenase 


MTHFD 


methylenetetrahydrofolate dehydrogenase (NADP) 


ADK 


adenylate kinase 


MTHFR2 


5,10-methylenetetrahydrofolate reductase 


ADK3 


adentylate Icinase (GTP) 


NADH6 


NADH dehydrogenase (ubiquinone-8 


AGMT 


agmatinase 


NDPK 


nucleoside-diphosphate kinase 


AKGDH 


2-Oxogluterate dehydrogenase 


NDPK3 


nucleoside-diphosphate kinase (ATP:CDP) 


ALAR 


alanine racemase 


NDPK5 


nucleoside-diphosphate kinase (ATP:dGDP) 


ALARi 


alanine racemase (irreversible) 


NDPK7 


nucleoside-diphosphate kinase (ATP:dCDP) 


ARGDC 


arginine decarboxylase 


NDPK7 


nucleoside-diphosphate kinase (ATPidCDP) 


ASNS 


asparagine synthase (glutamine-hydrolysing) 


NDPK8 


nucleoside-diphosphate kinase (ATP:dADP) 


ASNS2 


asparagine synthetase 


02t 


O2 transport (diffusion) 


ASPTA 


aspartate transaminase 


ORNDC 


Ornithine Decarboxylase 


BIOMASS 


Biomass reaction 


PDH 


pyruvate dehydronase 


CBMK 


Carbamate kinase 


PGK 


phosphoglycerate kinase 


CBPS 


carbamoyl-phosphate synthase (glutamine-hydrolysing) 


Plabc 


phosphate transport via ABC system 


C02t 


CO2 transporter via diffusion 


PIt2r 


phosphate reversible transport via symport 


CS 


citrate synthase 


PPC 


phosphoenolpyruvate carboxylase 


DB4PS 


3,4-Dihydroxy-2-butanone -4-phosphate 


PTAr 


phosphotransacetylase 


DHORD2 


dihydoorotic acid dehydrogenase (quinoneS) 


RBFK 


riboflavin kinase 


DHORD5 


dihydroorotic acid (menaquinone-8) 


RBFSa 


riboflavin synthase 


DKMPPD 


2,3-diketo-5-methylthio-l-phosphopentane degradation reaction 


RBFSb 


riboflavin synthase 


DKMPPD2 


2,3-diketo-5-methylthio-l-phosphopentane degradation reaction 


RNDR 


ribonucleoside-diphosphate reductase (ADP) 


EX urea 


trans-system-boundary (irreversible) 


RNDR2 


ribonucleoside-diphosphate reductase (GDP) 


FMNAT 


FMN adenylyltransferase 


RNDR3 


ribonucleoside-diphosphate reductase (CDP) 


FRD2 


fumarate reductase 


RNTR 


ribonucleoside-triphosphate reductase (ATP) 


FUM 


fumarase 


RNTR2 


ribonucleoside-triphosphate reductase (GTP) 


GALU 


UTP-glucose- I-phosphate uridylyltransferase 


RNTR3 


ribonucleoside-triphosphate reductase (CTP) 


GALUi 


UTP-glucose-1 -phosphate uridylyltransferase (irreversible) 


RPE 


ribulose 5-phosphate 3-epimerase 


GAPD 


glyceraldehyde-3-phosphate dehydrogenase 


SUCCt2b 


Succinate efflux via proton symport 


GARFT 


phosphoribosylglycinamide formyltransferase 


SUCDli 


succinate dehydrogenase 


GART 


GAR transformylase-T 


SUCD4 


succinate dehyrdogenase 


GHMT2 


glycine hydroxymethyltransferase 


SUCOAS 


succinyl-CoA synthetase (ADP-forming) 


GLUDy 


glutamate dehydrogenase (NADP) 


TALA 


transaldolase 


GLUSy 


glutamate synthase (NADPH) 


THDPS 


tetrahydropicolinate succinylase 


GND 


phosphogluconate dehydrogenase 


THRAr 


Threonine Aldolase 


HC03E 


HCO3 equilibration reaction 


THRS 


threonine synthase 


HSK 


homoserine kinase 


TKT 


transketolase 


HSST 


homoserine 0-succinyltransferase 


TKT2 


transketolase 


ICDHyr 


isocitrate dehydrogenase (NADP) 


TRPS 


tryptophan synthase (indoleglycerol phosphate) 


ICL 


Isocitrate lyase 


TRPS3 


tryptophan synthase (indoleglycerol phosphate) 


KAS14 


b-ketoacyl synthase 


UREAt 


Urea transport via facilitate diffusion 


KAS15 


b-ketoacyl synthase 


YALTA 


valine transaminase 


MALS 


malate synthase 


VPAMT 


Valine-pyruvate aminotransferase 


MDH 


malate dehydronase 
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Figure Legends 
Fig. 1 

A subgraph of the citrate cycle in E. coli metabolic network. In Boolean scheme, 
metabolites(ellipses) are treated as Boolean disjunction(OR), while the reactions(rectangles) 
as conjunction(AND). If this graph were isolated, though actually not the case, from the 
other reactions and metabolites, the metabolite coa would be no more supplied only 
when both the reactions SUCOAS and CS are blocked. On the other hand, the reaction 
AKGDH would not be operational when either of the metabolites coa or akg is knocked 
out. In this hypothetical subnetwork severed from the other part, bold red arrows indi- 
cate blocked reaction paths due to the knockout of the reaction C S . Defining avalanche 
size as the number of reactions subsequently turned off on account of the removal of a 
target reaction, reaction CS has avalanche size as big as three. 

Fig. 2 

Normalized cumulative lethality score(CLS) with respect to the wild-type flux levels 
grown on distinct carbon sources, where 0(1) in the ordinate corresponds to the high- 
est(lowest) ranker in the flux level. CLS against flux level shows no remarkable convex- 
ity, if any, implying they cannot be a lethality measure. 

Fig. 3 

Normalized cumulative lethality scores drawn for the avalanche size of single and mul- 
tiple deletion in the Boolean version of the E. coli metabolic network, where 0(1) in the 
ordinate corresponds to the highest(lowest) ranker in the flux level. Inset: Boolean 
avalanche size distribution under single targeted deletion of each reaction follows a 
power-law with the (noncumulative) exponent 2.5. 

Fig. 4 

Predicted lethal (red circles) and nonlethal (blue crosses) reactions in the avalanche size 
and the wild-type flux level determined by FBA (main frame) and the Boolean scheme 
(inset). In both the FBA and Boolean schemes, lethal reactions have larger avalanche 
size and are dispersed along wide range of flux level at the same time. Here, only the 
reactions being turned on in the wild-type strain are plotted. 

Fig. 5 

(a) As a consequence of NDPK7(nucleoside-diphosphate kinase, ATP:dCDP) removal, 
the reaction RNDR3(ribonucleoside-diphosphate reductase) is also blocked(dotted red 
lines), but RNTR3(ribonucleoside-triphosphate reductase) is newly turned on(solid (sky- 
)blue lines), with NDPK3(nucleoside-diphosphate kinase, ATP:CDP) flowing more flux 
to make up for the growth flux. Here, only three reactions are retuned, two of which 
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depicted by filled blocks constitute a synthetic-lethal pair in both the Boolean and FBA 
schemes. The flux changes are shown in units of mm/g DW-hr. (b) E. coli can subsist 
without the reaction SUCD4, but, in the absence of SUCD4, aU the reactions DB4PS, 
RBFSa, RBFSb, RBFK, and FMNAT become lethal. In particular, DB4PS has a 'long- 
range correlation' with SUCD4 along with the intermediary lethal reactions. Other parts 
of the network that the avalanche cascade does not reach are omitted for brevity. 

Fig. 6 

Synthetic lethal pairs tend to be closer neighbors than arbitrary pairs of reactions. Here, 
ALL refers to all pairs of reactions, while BOOL(FBA) stands for the Boolean(FBA) 
synthetic-lethal pairs. In the main histogram, distance between a pair of metabolites is 
measured without counting the connection via currency metabolites or cofactors, while 
the inset is obtained by counting every conections. Reactions sharing either substrates or 
products are regarded unit distance apart from each other. Across the analytical schemes, 
almost all (>95%) synthetic-lethal pairs are only three steps apart and below, in sharp 
contrast with the broad distribution of distances between arbitrary pairs. See the text for 
details. 

Fig. 7 

Classification of lethal and synthetic-lethal reactions according to the functional cate- 
gories. Frequencies are normalized by total number of lethal singlets and doublets in 
both the Boolean and FBA scheme, respectively. 
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Table Legend 
Table 1 

List of synthetic-lethal reactions. Upper(lower) two cells include synthetic lethals be- 
longing to the same(distinct) functional categories, and the wild-type flux of each reac- 
tion is given in units of mm/g DW-hr. 
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